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COUNTING STRONGLY-CONNECTED, 
SPARSELY EDGED DIRECTED GRAPHS. 

Boris Pittel 
Ohio State University 



Abstract. A sharp asymptotic formula for the number of strongly connected di- 
graphs on n labelled vertices with m arcs, under a condition m—n — > oo , 771 = 0{n) , 

1/^ ' is obtained; this solves a problem posed by Wright back in 1977. Our formula is a 

counterpart of a classic asymptotic formula, due to Bender, Canfield and McKay, for 
the total number of connected undirected graphs on n vertices with m edges. A key 
ingredient of their proof was a recurrence equation for the connected graphs count 

r) . due to Wright. No analogue of Wright's recurrence seems to exist for digraphs. In a 

previous paper with Nick Wormald we rederived the BCM formula via counting two- 
connected graphs among the graphs of minimum degree 2 , at least. In this paper, 
Cu I using a similar embedding for directed graphs, we find an asymptotic formula, which 

includes an explicit error term, for the fraction of strongly-connected digraphs with 
parameters m and n among all such digraphs with positive in/out-degrees. 



(N 
> 

(N . . . , , . 

cr^ ■ 0. Intoduction. In a pioneering paper [24], Wright described two potent ap- 

^^ I proaches to calculating c{n, n + k) , the number of connected labeled graphs with 

10 ' n vertices and n + k edges. One was based on a classic exponential identity ex- 

pressing a bivariate generating function of all labeled graphs through that of the 
connected graphs. This identity lent itself to a recursive procedure well tailored 
for computing the exponential generating functions Wk{x) = ^^ x'^c{n, n + k)/n\ , 
and finding the asymptotic formulas for c{n,n + k) for small k, going beyond a 
kS I formula for the first nontrivial c{n,n + 1), due to Bagaev [1]. By representing a 

j^ ■ connected graph as a connected 2 -core with a forest of trees sprouting from the core 

vertices, Wright found an alternative way of computing Wk{x) for small k, and 
established a remarkable expression of Wk{x) through the exponential generating 
function of the rooted trees for all k . Neither of these approaches could be used to 
get asymptotics for k = k{n) — ;• 00. 

In [25] Wright described a quadratic recurrence equation for the numbers c{n, 
n + k) , and the related recurrence for the exponential generating function Wk{x) . 
The recurrence relates to the last, (n -|- k)-th, step of an edge-insertion process 
that begins at an empty graph on [n] ; at the terminal step a connected graph with 
n + k edges is born either if the penultimate graph is already connected and has 
n + k — 1 edges, or if it consists of two components with parameters {ni,ni -j- ki) , 
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2 BORIS PITTEL 

with ni+n2 = n, ki+k2 + l = k, and the (n+/c) -th edge joins the two components. 
Wright used the recurrence to obtain a sharp asymptotic formula for c{n, n + k) 
with k = o{n^'^) as n — ?• oo, a difficult result with many ramifications. 

BoUobas [6], [7] discovered that a leading factor in Wright's formula [25] was an 
upper bound for c{n,n + k) far beyond k = o{n^'^) , and used his bound to identify 
sharply — for the first time — a transition window in the Erdos-Renyi random edge- 
insertion graph process, [9]. Wright's results and enumerational insight were also 
a key ingredient in the subsequent studies of random graphs , Luczak [14], Luczak 
et al [16], Janson et al [12], Flajolet et al [10], Daude and Ravelomanana [8], Pittel 
and Yeum [22], Stepanov [23], to name just a few. 

In a monumental paper [3], Bender et al managed to use Wright's quadratic 
recurrence to derive an extension of Wright's asymptotic formula for k = 0(n In n) , 
i. e. all the way till the number of edges is so large that c{n,n + k) is asymptotic to 
the number of all graphs with parameters n and n + k. Later with Nick Wormald 
we found [20], [21] a more combinatorial, and less technical way to rederive the 
Bender-Canfield-McKay formula. It was based on Wright's decomposition device. 
First we counted the connected 2 -cores among all the graphs with minimum degree 
2, at least. After this step, enumeration of the connected graphs was reduced to 
enumeration of the cores with a forest of trees attached to the core vertices. More 
recently van der Hofstadt and Spencer [11] found yet another way to obtain the 
BCM formula. At the heart of their approach is a breadth-first search for connected 
components in the Bernoulli-type random graph G(n, P(edge) = p) ■ Interestingly, 
this algorithm had already been used by Karp [13] to obtain a sharp esimate of the 
largest strongly-connected subgraph size of the random digraph with P(arc) = c/n 
in a supercritical phase c > 1 . 

Wright also wrote a companion paper [26] on determination of g{n, 
n + k) , the total number of strongly-connected digraphs with n + k arcs. He 
was able to find a qualitative description of how g{n,n + k) depends on n , k, and 
to find explicit formulas for /c = 0, 1, 2 . We gather that he did not find a recurrence 
for g{n,n -\- k) that might have been used to study the case k = k{n) — t- oo. (We 
are not aware of any such recurrence either.) 

Aside from Bender et al [4] who found an elegant way to use their formula from [3] 
for enumeration of weakly-connected digraphs, the problem of asymptotic behavior 
of g{n, n + k) has remained open. 

Our goal in this paper is to obtain a sharp asymptotic formula for g{n, n + k) , 
with essentially best remainder term — a directed counterpart of the BCM formula — 
by using a proper version of the embedding device. 

Consider the digraphs on n labelled vertices, with m arcs. Let g{n,m) stand 
for the total number of strongly connected digraphs, with parameters n and m; 
thus m > n necessarily. Suppose r := m — n — > oo and m = 0{n) . We will show 
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X (1 + 0{r-^ In^ r + r^n-^/^+^)) , 

for any fixed e > 0, 7 € (0, 1/2) . Here Xe"^/{e'^ — 1) = m/n, and y is a positive 
Poisson variable with mean m/n, so that Var[y] = [m/n)(^X — r/rij . In particular, 
if r = 0(ni/2-'^) , cr > 0, then 

(2) ^(^,^) = ^_ (l + 0(r-Mn2r)). 

DTren \2r/ ^ ' 

Neglecting In r, the error term would be of order 1/r, which we believe is the 
correct order of the remainder term. Inevitably our proofs run parallel to the 
argument in [20]. There are quite a few unexpected challenges though, as the 
graph component notion morphs into two, harder-to-handle, dual notions of a sink- 
set and a source-set, the subsets of vertices with no arcs going outside, and no arcs 
coming from outside, respectively. 

Note. The author is well aware of a research started earlier by Xavier Perez 
and Nick Wormald. According to Wormald, (private communication), they expect 
a full argument to be less technical since their aim is a cruder version of (1), i.e. 
an explicit leading factor times 1 -|- o(l) . 

The author plans to use the results and the insights of this paper for a sharp 
analysis of a phase transition window for the directed counterpart of the Erdos- 
Renyi graph process. 

The rest of the paper is organized as follows. In Section 1 we state an asymptotic 
estimate for the number of digraphs with given in/out degrees (Theorem 1.1), 
analogous to those by Bender and Canfield [2], Bollobas [5], McKay [17], and McKay 
and Wormald [18] for graphs. In a proof sketch we introduce a random matching 
scheme similar to a random pairing introduced by Bollobas [5] for graphs. We 
use this estimate to prove an asymptotic formula for the number of digraphs with 
constrained in/out degrees (Theorem 1.2), and to formulate auxiliary bounds we 
use later. In Section 2, (Theorem 2.1) we prove a bound for the number of those 
digraphs without isolated cycles. In Section 3 we use Theorem 2.1 and the bounds 
from Section 1 to prove (Theorem 3.1) a sharp 0{n~^) bound for the fraction of 
digraphs with a complex "sink-set" ( "source-set" ) having less than half of all arcs, 
but without a simple sink-set (source-set). This implies that the fraction of strongly- 
connected digraphs differs from the fraction of digraphs without simple sink/source- 
sets by at most 0{n~^). Finally, in Section 4 we determine a sharp estimate for 
the latter fraction, which turns out to be Q{{m, — n)/n) ^ n~^ (Theorem 4.2), and 
this result yields the formula (1). 
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1. Enumerating the digraphs with restricted in/out-degrees. 

In this Section we provide a set of estimates, both crude and sharp, for the counts 
of all digraphs with (most of) their in/out-degrees being positive. 

Theorem 1.1. Let (5i, . . . ,(5^ > and Ai, . . . , A„ > 6e such that 
(1.1) J](5, = J]A, = m. 

i i 

where m > n. Introduce g{S,^), the total number of simple digraphs with in- 
degrees 5i and out-degrees 5i. If D := maxjmaxi 5i,maxj A^} = o{m}'^) , then 






(1-2) 9{ 

where the "fudge factor", always 1 at most, is given by 

(1.3) F{d, A) = exp f-l ^ 6A^ - ^ Y.(^.)2 ^^(A,)^ + OiD^m) 

ad)2:=d{d-l)). 

A combinatorial core of the proof is a random matching scheme, similar to a ran- 
dom pairing model for undirected graphs introduced by Bollobas [5] in his prob- 
abilistic proof of the Bender-Canfield [2] formula for the number of graphs with 
given, bounded, degrees. Consider two copies [m]i and [m]2 of the set [m], to- 
gether with the partitions [m]i = ^ili, [m]2 = ^jOj , \Ii\ = 6i, \0j\ = Aj . Each 
of m\ bijections vr : [m]2 — )• [m]i determines a directed multigraph G{tt): i -^ j 
is an arc if vr(z^) = /i for some v G Oj and fi £ li. Notice that, each simple di- 
graph G with in-degrees 6i and out-degrees Aj , corresponds to exactly ]^j(5i!Ai!) 
bijections vr. We call those bijections digraph-induced. Therefore F{S,A.) defined 
by (2) is the probability that the bijection vr chosen uniformly at random from 
among all ml bijections is digraph-induced, i.e. there is no i such that 7r(z^) = ^ 
for v £ Oi, iJ, & li, and there is no {i,j) such that 7r(z^t) = fit, t = 1,2, {vx^V2, 
/ii 7^ /i2 ), and Vt G Oj , \Jit^Oi. 

Since F{S,A) < 1 , we see that 



:i.4) g{d,A)<ml[ [] 



1 



5,;! A,! 



always. 

So the task is to show that this probability F{S, A) < 1 is given by (1.3). The 
sums in the exponent in (1.3) are asymptotic estimates for the expected number 
of loops and number of double arcs in G{Tr) . Our claim is a digraph analogue of 
McKay's far-reaching extension [17] (McKay and Wormald [18]), based on Bollobas' 
pairing model, of the Bender-Canfield formula for the number of undirected graphs 
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with a given degree sequence di, . . . ,dn, and maxidi = o{M'^'^). We omit the 
proof as it is very similar to those in [17] and [18]. 

Note 1.1. Notice that the RHS of (1.4) is not necessarily an integer, but 

\ie\n\ ) 

most certainly is! In fact, /i((5,A) counts the total number of ?Ti-long ordered 
sequences of insertions of single arcs into an initially empty digraph that result in 
a mti/ii-digraph with in-degrees 6 and out-degrees A . 

This formula follows from a bijection between that the set of those sequences 
and the set of pairs of ?7i-long words in the alphabet of n letters 1, . . . ,n, such 
that the first (second, resp.) word has 6i (A^, resp.) letters i. Observe also that, 
given /x = {^j^i}j^ig[„] with ||/2|| = ?ti, the number /i(//) of the sequences, that 
result in the multigraph with exactly /Uj^i arcs j — ?• i , is obviously the multinomial 
coefficient 

(1.5) M/.) = (^ 

On the other hand, ml g{d,A.) is the total number of the ?Ti-long sequences 
that result in a simple digraph with in-degrees S and out-degrees A . We call 
these special sequences graphic. Thus F{S, A) is the fraction of graphic sequences 
among all ?Ti-long insertion sequences leading to a multi-digraph with the prescribed 
in/out-degrees. 

McKay's formula was used by Pittel and Wormald [19] to derive a sharp estimate 
of Ck{n,m), the total number of graphs on [n] with m = O(nlnn) edges, and 
with minimum vertex degree k, at least, {k > 1). We use Theorem 1.1 to derive 
an asymptotic formula for Ci^i{n,m) , the total number of digraphs on [n] with 
mini (^i ^ 1 ) min A^ > 1 . In fact we will need a slightly more general set-up. 

Given rii < n, let n = (ni,n), and let Ci^i(n, ?ti) denote the total number of 
digraphs on [n] , with m arcs, such that 

(1.6) 5, > 1, (i e [n]); A, > 1, {j G [m]); A,- = 0, (j > m). 
By (1.4), 



Ci,i(n,m)<m! J] 11 ^Ta 



6,1 Ail 

S./\ meet (1.6) i^\n] 



--ml [x'^xT*] Yl n 

S,A meet (1.6) iG[n] 



<5i Ai 



X *X 



<5.!Ai! 

=m! [x™]/(x)" [xr]/(xi)"\ fiy):-- 
So, for all X > 0,xi > 0, 

(1.7) d i(n,m) < m! , 
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and the best values of x and xi are the minimum points of the first fraction and 
the second fraction, i.e. A and Ai , the roots of 

Aie''*^ m 
e^^ — 1 ni 



(1.8) 






Ae^ 
e^-1 


m 
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Using 


the 


Cauchy 


integral formula 








h 

dz, 



and an inequality 

\z\ — Re z 



le" - 1| < (e'"^' - 1) exp 
one can show easily that, in fact, 

(Here and elsewhere A <h B means that A = 0(B) uniformly over all parameters 
that determine the values of A and B .) So (1.7) can be strengthened to 

(1.9) Ci,i(n, m) <ft m! 



(l + nx)V2a;"* (l + niXi)V2a;7" 
for all x,xi > 0. In particular, 

(1.10) Ci,i{n,m)<bml—j^\^, Vx > 0. 

(1 + nx)x^'" 

The next theorem gives an asymptotically sharp formula for Ci^i(n, ?ti). 

Theorem 1.2. Let rii = Q{n) , r := m — n ^ oo , m = 0{n) . Introduce Y and 
Yi , two positive Poissons, with parameters A and Ai , i.e. 

AV?! Xi/j\ 

Then 

Ci i(n, m) = (1 + 0{r-^ + n-'/\')) m\ /^"/(-^i)"^ 
' ^ ^ ^ '^ (AAi)- 



27r(n Far [F]) 1/2 (n^ Var[Yi]Y/^ ' 
for every e > 0, where 

m AAi 
1.12 ^ = _ + _^. 

n 2 
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The idea of the proof is that, introducing the independent copies Y^ , . . . ,Y^ 
of Y and Y^,..., Y^"' and denoting Y = {Y\ ... ,Y'') , Yi = {Y^ , ... ,Yl'\n - 
rii zeroes), we can rewrite (1.2) in the following, suggestive, way: 



Ci i(n, m) =m\ 



f{\rf{\ 



\ni 



[x-^x"^' 






S,/\ meet (1.6) i€[n] '^ ^ ' J \ ^> 



/(A)"/(Ai 



(1.13) ^rn!M:4(^E 



F(Y,Yi)| ||Y|| = ||Yi|| =m 



(1.14) xP(||Y|| =m) P(||Yi|| =m). 
Here 

(1.15) E[F(Y,Yi)l{||Y|i=||Y,|l=w] =E[F(Z,Zi)], 
where, denoting i = (zi, . . . , i„) , j = (ji, . . . , jV 
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rr" i/i I 

(1.16) P(Z = i)= ^^-=1^- (||i||=m), 

E nt=i i/jt! 

and, denoting i = (ii,...,i„J, j = (ji,...,j„J, 

m^ 1/? ' 

(1.17) P(Zi = i)= ^^-=L„ - (||i||=m). 

E ni=i i/jt! 

The next step is to prove two similar cases of a local limit theorem 

P(iiY|i = m)= i + Q(^"^) 

^" " ^ (2.nVar[y])V2' 
P(||Yi||=m) ^"^^ ' 



(2^niVar[yi])V2' 

with the remainder term 0{r~^) . 

The last step is to prove that on the event {|Y| = |Yi| = m} , in probability 
F(Y,Yi) is within l + o(l) factor from the RHS of (1.3), where 5i, (5^)2, (i G [n]), 
are replaced with E[Y] and E[(y)2], and A^, (Ai)2, {i G [ni]), are replaced 
with E[yi] and E[(yi)2]. We omit the technical details as the argument is a 
natural modification of the proof of a corresponding formula in [19] for the total 
number of undirected graphs of mindegree /c > 1 , at least. We mention only that 
in (1.11) r^ comes from an observation that, with sufficiently high probability, 
maxjmaxiy*, m.aXjYj'} < r'^ , and that r~^ comes from the remainder term in 
(1.18). □ 

Note 1.2. Implicit in the theorem 1.2 and its proof is the following: for r — t- 00, 
m = 0{n) , 



:i.l9) [x""] (e^ - 1) 



e 



A 



1)" l + 0(r-^] 



A2™ (27rnVar[y])i/2- 
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Corollary 1.3. Let r — > oo, m = 0{n) . Then 

f(\\2n —r] 

(j^,„, ^"'"••"' = (1 + Of''"" + """'^••'» -' ^ ■ S^^iMTi 

m A 

7? = r]{n,m) := h — , 

n 2 

for every e > . 



Since Var[y] = e(A), Var[yi] = e(Ai), the bound (1.10) for Ci,i(n,m) with 
X = X, xi = Ai contains ah the key factors in the sharp estimate (1.15), except for 



Note 1.3 Recalling the note 1.1, m\Ci^i{n,m) is the total number of ?TT,-long 
insertion sequences s resulting in a digraph with non-zero in/out-degrees. Let 
hi^i{n,m) denote the total number of those sequences resulting in multi-digraphs 
with non-zero in/out-degrees. Dropping the fudge factor F , we have a counterpart 
of (1.13)-(1.15): 



X2m 



(1.21) /ii,i(n,m) = (m!)2 i^QL- x P(||Y|| = m) P(||Yi|| = m). 

Consequently, 



1.22 , , \' =EFZ,Zi , 

where Z, Zi are defined in (1.16)-(1.17), with ni = n this time. Now E[F(Z,Zi)1 
is asymptotically equivalent to e~^ , which is positive if r = 0{n). Thus, for r = 
0{n) the graphic ?Ti-long sequences constitute an asymptotically positive fraction 
among all m-long sequences of arc insertions. Moreover, from the proof of the 
theorem 1.2 it follows that, for r = 0{n) , 

Ijs : \F(S(s),A(s))-e-'^\ <r=n-i/2+7|| 
(1.22) IL I V V / V ^^ I J I 

>/ii,i(n,m)(l-0(e-'-^^"')), 

for all e > 0, 7 G (0, 1/2) , and some absolute constant c > 0. 

Section 2. Counting digraphs with restricted in/out-degrees that have 
no isolated cycles. 

Recall that, given ni < n, Ci^i{n,m) denotes the total number of digraphs with 
in-degrees of all vertices and outdegrees of the vertices in [ni] each being 1 , at 
least. Introduce Ci i{n,m) , the total number of these digraphs without isolated 
cycles. How much smaller is C°;^(n, tti) compared with d,i(n, tti)? 
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Theorem 2.1. // (n = {ni,n2),m) meet the conditions of Theorem 1.2, and ri 
m — ui = 0{r) , then 



(2.1) 



C? 1 (n, ?n) <b — Ci 1 (n, m) , r = m — n. 
n 



Proof of Theorem 2.1. Obviously, we may and will assume that r = o(n) . If so, 
in (1.10) f] = 0(1), see (1.11). Suppose a digraph is chosen uniformly at random 
from among all Ci^i(n, ttt,) digraphs. Let X denote the total number of isolated 
cycles in the random digraph. We need to show that P{X = 0) = 0{r/n) . 

Since the exponential generating function of directed cycles is 



e>2 



ln(l — x) 



the binomial moments of X are given by 



E 



k\ ^ 



ni\ Ci^i{n{a),m — a) 
Ci,i(n,m) 



a! X 




^ ^i?n,™(a)[x1 [Y,'' 



k\ ^ 

a<ni 



Rn,m{a) '■- 



£>2 



"-i!C'i^i(n(a),?7i — a) 
(ni — a)!Ci^i(n, 7TT,) 



(n(a) = (ni -0,71-2). 



Explanation: The binomial moment is the expected number of /c-long unordered 
tuples of isolated cycles. To generate those tuples in all digraphs in question we 
choose a vertex set of a generic cardinality a from [ni] in (^^^ ways, then form an 

unordered family of k disjoint cycles on these a vertices, in {a\/k\)[x°'](^^x^ /£) 
ways, and finally select an admissible complementary digraph on the remaining n—a 
vertices with m — a arcs, in Ci^i(n(a),m — a) ways. Furthermore, an admissible a 
also satisfies m — a < {n — aY , which implies that n — a > \fr . 

Consequently, by using an inversion formula 



P(^=i) 



k>j 






we obtain 

(2.2) 

where 



P(X = 0) 



E 

fc>0 



(-1)'=E 



J]«n,m(a)K]/i(x), 



h(x) 



exp 



^-^ X 



l>2 



- = 1 



x)e 
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So 



[x"]/i(x) 



a! (a-1)! 



, a > 1, [x°]/i(x) = 1. 



Let us find a sharp asymptotic formula for the sum in (2.2). Using (1.10) for 
Ci^i(n, m) and (1.8), with x = X, xi = Ai , for Ci^i{n{a),m — a) , we bound 



Rn,m{ci) <i 



{ni)a f AAi 



im)a V/(A)/(Ai) 



nrii 



(n — a)(ni — a) 



< 



n 



■ O" 



rii — a 



Here, by (1.7), 

(2.3) 

Introduce 

Consider a > Ar 



a 



Hi AAi 



Ae- 



m /(A)/(Ai) e^-1 
ln(n/r) 



< 1. 



An. — O 



lnln(n/r) 



Y, Rr.,m{a) [X'']h{x) 



a~>An 



(2.4) 



<i 



<i 



V- 1 



n 



yl„<a<ni/2 

1 n 

+ 



Lni/2J! 



^n! L«i/2J! 



<. (- 

n 



For 1 < a < An we need a sharp estimate Rn^miO') , within a factor 1 + 
0{r/n). Observe upfront that the asymptotic estimate (1-10) used separately for 
Ci^i(n, m) and Ci^i{n.{a)^m — a) would not work, as the remainder term 0{r~^ + 
n~^''^'^'^) is too big. (We already confronted a similar issue in [20]; the hurdle 
is higher this time, as in [20] we were content to have an error term of order 
o(r/nY''^\ , rather than 0{r/n). For the next theorem, at a similar point we will 
even need o{r/n) .) 

Instead of (1.10), we use the exact formulas (1.12)-(1.13) for both Ci^i{n,m) and 
Ci,i(n(a), 

m — a), using A and Ai , the roots of (1.7), for both Ci^i(n,m) and Ci^i(n(a),r?T,— 
a) . So 



(2.5) R^,^{a) 



{ni)a f AAi 
(m)„ V/(A)/(Ai: 



E, 



ii{a),m — a 






m — a 



m) 






m — a 



m) 
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where E^^rn = £^n(a),m-a|„=o' (^^'^ (1-14)-(1.16)) 



(2.6) -Bn(a),m-a = E 



^ ^ rii—a 

lB(a)exp V Z^(a)Zi(a) 



m — a 

i = l 



0(P(S(a)^)), 



^ ni —a 

i^(rai-a,ni] J = l 

+ 0(m-^ max(Z^(a) + ^^(0))^)) 

and 

B{a) := {max(Z^(a) + Z{{a)) < oj}, 

with Lo = o{m~^'^) to be specified shortly. Our notation emphasizes dependence of 
Z^'s on a: for instance, Z^{a), . . . , Z'^~°'{a) are occupancy numbers in the uniformly 
random allocation of m — a distinguishable balls among n — a boxes, subject to 
the condition "no box is empty" . 

First, let us dispense with the ratios of local probabilities. In [20] the follow- 
ing estimate was proved. Let Y^,. . . ,Y^ be independent Poissons distributed on 
{2, 3, . . . } , such that E[y-'] = 2m/ n. Then, for a = o{n) , 

— ^ — j^irz^ ■ r — - = l + 0[an + a rn ), r:=2(m — n). 

No real changes are needed to show that for our Poissons y^, . . . , Y"^ (distributed 
on {1,2,...}), 



V{Y.UY^=m) 



(2.7) ,t^n ■ r-^ = 1 + 0(an"^ + a^rn"^), r -.= 7x1-71, 



and likewise 

/ ■ — = l + 0{an +arin ), ri=m-ni. 

P(Ej=i^i =^) 

Since ri = 0{r) we conclude that 

v[Y:aY^ = m-a) p(E;ir>?' = ^-«) 

(2.8) ^{T.:=iY^ = m) ' V{Yr,UYl=m) 

= l + 0{an~^ + a^rn~^). 

Next we focus on En(a),m-a/En^m ■ Let us look at m.axi(^(^ni-a,ni] Z^{a) . For J > 1 , 
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by (1.15), 



PiZ{a)=j] 



<i 



i i>0: ||i||=m — a— J 

^ E nr=7i/j*! ' 

j>0: ||j||=m-a 
1 !j,m — a—j'\f„x 1 \n — a — 1 

1 (e^ - l)"-'*-V[A'"-'*-V(^ - a - 1)A 



j! (e^ - 1)"-"/ [A'"-"^(n-o)A] 

1 A^ 



j! e-^ — 1 



As A < 2r/n, we see then that P(Z(a) > j) <b {2r/ny . Therefore, picking e > 0, 

/2r 

P(maxZVa) > r'') <n\ — 

i \ n 

<b— exp[lnn — r^ ln(n/(2r))] 

(2.9) ^- - ' 

n \n/ 

since, for r — )■ oo and r = o{n) , 

r^ ln(n/r) ^ Inn. 
And on the event {maxi Z^{a) < r'^} , we have 

(2.10) max.Z-(a)^^^r^^r^,,_,^^r^ 

771 n n n 

provided that e < 1/4. Obviously the counterparts of (2.9), (2.10) hold for 
maXj Z^l (a) as well. 

So, setting uj = r^ in (2.6), 



i^n(a),„^-a = 0{{r/nf/'+') + (l + 0(r^7n)) 



xE 



ni —a 



exp 

m — a 

1=1 



— Y. Z\a)Zl{a) 

— n ^ — ^ 



i^(rai— a,ni] J — 1 

and, by E[e^] > e^t'^J , the last expected value is bounded below by 

/ ni — a , , ,, , , ,, (n — a)(ni —a) ,, , ss -, r/ . vv - 

exp -^ E Z a E Zi a - '•>'• \ ^ E Z a 2 E Zi a 2. 

\ m — a 2(m — or 
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which is bounded away from 0. Therefore 

^ I; Z\a)Zl{a) 



n(a),m-a = l,l + 0((r/n) ■ T 7" / 'V; -C/„(a),m-a' 

ni —a 

expl 



^n(a),m-a •— ^ 



m — a 



i = l 



1 



ni—a 



2(m- ay 



Y^ {Z%a)h Yl {Zi{a)h 



i^(ni—a,ni] j — 1 

It remains to consider E* , ^ /i?*/n^ • Notice that, conditioned on the event 

A = {Z\0) = 1, i G (m - a,ni]} n {Zj(0) = 1, i e (ni - a,ni]}, 

{Z*(0), ^i(O) ; i ^ (ni — a, ni], j < ni — a} has the same distribution as {Z*(a), 
Zl{a) : i ^ (ni — a, ni], J < rai — a} , and 

P(^) > 1 - a{F{Z{0) > 1) + P(^i(0) > 0)) = 1 - O(aA) = 1 - 0{ar/n). 

Therefore 

K(a),m-a = (l + 0(ar/n))^;^„)^^_„, 



^n(a),m-a •— -^^ 



exp 



ni— a 

— Y Z\0)ZI{0) 



m — a 



i=l 



(2.111 



2{m — a)^ 



i^{ni—a,ni] j = l 



The contribution to the expectation E^*^s ^_^ from the random outcomes with 
maxi,j(Z*(0) + Z({0)) > r^ is of order 

P(max(Z^(0) + Zi{0)) > r^) < {r/nY'/^+\ 

If maxi^j(Z^*(0) + Zl{0)) < r^ , then the difference between the random exponents 
in (2.11) for o = and o > is, by simple algebra, of order ar'^^/n. So i?*L) m-a 
is within the multiplicative factor 

(1 + 0{{r/nf/^+') [exp(0(r'^7n))]'' 

away from E\^-. ^ . Collecting the pieces we obtain 



(2.12) 



E, 



n(a),7TT, — a 



[l + 0{ar/n)) [exp(0(r'^7n))]° 



uniformly for 1 < a < An . 
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Combining (2.5), (2.8) and (2.12), we conclude: uniformly for 1 < a < A^, 

im)a \f[X)f[Xi) J 

,2 „2 



, , ,^^'^,^ .O(-^Vn)') .expf-^ + ^'l (l + 0(ar/n)) 
(m/ni)/(A)/(Ai) ^ ^ V 2ni 2^; ^ ^ ^ ^^ 

= [aeO('^"/")]"(l + 0(ar/n)), 



see (2.3) for the definition of a . Since a = 1 — Q{r/n) , and e < 1/4, 
Therefore, invoking the inversion formula (2.2) and (2.4), and using 

^-^^TTT<0, Va>l, 
a! (a — 1)\ 

we see that P(X = 0) is bounded above and below by 

1 + E (^ - uhv) [^^""^'"^"Tll + 0(ar/n)) + 0{{r/nf) 

a=l ^ ^ '^ ^ 



(l_^eO('^^V"))e- + o 



n^ V(a-l)! a! 

a>l 



+ 0((r/n) 



(1 - ae^^'^'^/")) e-^'"'"'''"' + 0(r/n) = 0{r/n). 



This completes the proof of Theorem 2.1. D 

Note 2.1. Using Theorem 2.1 we will prove in the next section 3, Theorem 3.1, 
that it is quite unlikely that the random digraph with non-zero in/out-degrees has 
no isolated cycles and no sink/source-set with fewer than m/2 arcs. In Section 4, we 
will use the proof of Theorem 2.1 as a rough template for proving a genuinely sharp 
asymptotic estimate for the probability of non-existence of "simple sink/source- 
sets" in the random digraph with non-zero in/out-degrees. This estimate coupled 
with Theorem 3.1 will deliver an asymptotic fraction of strongly-connected digraphs 
among all such digraphs. 

3. Bounding the number of the digraphs w^ithout simple sink-sets and 
small complex sink-sets. 

Let Gi^i{n,m) denote a digraph on [n] which is chosen uniformly at random from 
all such digraphs with m arcs and with the smallest in-degree and the smallest 
out-degree both at least 1. We call 5 C [n] a sink-set (source-set resp.) if there is 
no arc i ^ j (j ^ i resp.) for i £ S , j ^ S . A digraph is strongly-connected iff 
it has no proper sink-sets and no proper source-sets. We call a sink-set (source-set 
resp.) S simple, if all the out-degrees (in-degrees) of Gs , the subdigraph induced 
by S , are equal 1, and complex otherwise. Our first step is to the following result. 
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Theorem 3.1. Suppose r := m — n ^ oo and r = 0{n) . Let An,m denote the 
event "Gi^i{n,m) has a complex sink-set containing at most ni/2 arcs, and has no 
simple sink-set". Then P{An,m) = 0{n~^) . 

Proof of Theorem 3.1. We will focus on a core case r = o{n) , and at the end of 
the proof we will briefly discuss how to handle r = 0(n) . 

Given u £ [3, n) , v < fj, < niQ , let X^^^ denote the total number partitions 
[n] = A^tl B , {\A\ = v), such that (1) A is a minimal complex sink-set with ^ 
induced arcs, and (2) neither Ga nor Gb contain a simple sink-set or source set. 
(Minimal means that A does not contain a smaller complex sink-set.) Let £ > 
be fixed. Set 



(3.1) uo 

clearly fo = o{n) . Define 



2\ 1/3 

mn \ n 






2^2/3 ■' 



(3.2) X= Y. ^--M + Y. ^-^y 



f<i-'Q,/i<m/2 i-'>fQ,/i<m./2 



In both sums v and /i are subject to an additional condition, fi — v < r: as we 
shall see shortly, Xi,^^ = otherwise. The theorem will be proven when we show 
that E[X] = 0{n-^). 

For this we will use the following bound: 

r 

n 

(3.3) ^ (^) ^^l-l' (^x _ lY A^^e'^^ 



C") ^i{^l-u)\ x^' (e^-l)2'^' 

uniformly for all u < n, ni — fi > cm, c € (0, 1) being fixed, and a; > 0. 

To prove (3.3), let us bound the total number of partitions {A, B) in question in 
all the digraphs with parameters n , ttt, , of minimum in-degree and minimum out- 
degree 1 at least. A set A of cardinality v can be chosen in (^) ways. By symmetry, 
we may consider A = [v\, and we will use [n — v\ to denote i? = {z^-|-l,...,n}. 
Let us bound the total number of Ga's. Introduce R, the set consisting of 1 and 
all vertices j G [n] reachable from i by directed paths. R<1 A a,s A is a sink-set. 
Since i? is a sink-set itself, R = Ahy minimality of A . Thus there exists a directed 
tree T rooted at 1 that spans A, with arcs oriented away from the vertex 1. This 
spanning tree is such that the out-degree of 1 equals its the out-degree of 1 in Ga 
itself. Given 

(3.4) di>l, d2,...,d^>0, (di-l) + d2 + --- + d^ = i/-2, 

there are 

(1.-2)! 



(cii-l)!n.>2rf.! 



16 BORIS PITTEL 

rooted trees T with outdegrees di, . . . ,d^ . The digraph Ga is a disjoint union 
of T and a complementary digraph H on [i^]. 6i, Ai, (i £ [u]), respectively the 
in-degrees and the out-degrees of H , must meet the constraints 

(3.5) J]5, = J]A, = ^-i. + l, 

i i 

(3.6) di > 1, 5i > 1, Ai =0; Ai + di > I, {i > 2). 

(All in-degrees in H , except the root 1, are positive, whence the single condition 
Si > 1 ensures that TUif has all its in-degrees positive. The condition Ai + di > 1 , 
redundant for the root 1 , ensures that T U H has all its out-degrees positive as 
well.) The total number of H''s with the in-degrees 5i and the out-degrees A^ is 
{fi — ly + 1)1/ Y\iiSi^-Ai\) , at most. Hence the total number of Ga 's, with A = [i^], 
is on the order of 

(.-2)!(^-. + l)!^n (,._i,.')„.,Ar 

with d, 5, A meeting the constraints (3.4)-(3.6). Using the generating functions, 

^ i-l- (d- - If iiV5'A' 



ii(d.-l{,=i^)!5.!A,! 



meet (3.6) 



[yM— +1] ^^y _ i)e^—i)y 

u-l 



X X 



X"- ^z^ \ I s-^ x'^z^ 



-"^'-""1 E -^\A E 



d!A! ^ dlA\ 

,d>l,A=0 / \d-|-A>l 



Here 



[yM-^+i] (e^/ _ i)e('^-i)2' X [x'^-^z^-'^+i] e^(e^+^ - 1)'^"^ 



[yM-^+i] (e« _ i)e('^-i)?' < [y^-'^+i] ye^^ = [y^-^e'^^ 



(^-^)! 



and 



(^_2)!(^-z. + l)!^'^ Je ^e ') ' 
where 



c^- V(e" - 1)"-^ < ^ _/ , (Vx>0) 



, (e^ - 1)'^ 
x/^ 
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Collecting the pieces, we bound the total number of Ga 's with \A\ = z/ by 

<(x+i)r"V7'^/>'<-^-"". 

Turn to the complementary subgraphs Gb, B = [z/]^ := [n — v\. Let (5^, A^ , 
i G [n — z/] , denote in-degrees and out-degrees of G[„_,y] , and let di denote the total 
number of arcs emanating from i € [n — v] and ending at a vertex in [u\ . Obviously 

(3.8) 5i>l; a, + A, >1, ie[n-u], 

(3.9) Yl 6^= Y^ Ai = m-fi-d. 

Let us bound the total number of the admissible G^^-u] ^or given di, i € [n — i^]. 
The second condition in (3.8) simplifies to 

Ai > 1, i€ I{a) := {i (E [n - u] : di = 0}. 

Clearly, 

|/(cr)| >n-u-d, d:= ^ di. 

iG[n — ^] 

By the first condition in (3.8), n — i' < m — fj, — d; in particular, ji — v<m — n = r. 
Actually n — v<m — ^ — d, since otherwise all Si = 1 and G[n-i^] would contain 
an isolated cycle, a special case of a simple sink-set. So 9 < r — (// — z/) < r , and 

I [n — jy] \ /(<t) I <r<^n — v, 

i.e. out-degrees A^ cannot be zero for all but at most r specified vertices. Let 
Ccr {m — /U — (9, n — z/) denote the total number of digraphs on [n — v] with m — ji — d 
arcs such that 

(3.10) <5. >1, (ie [n-z/]); A, > 1, (i G /(<t)). 
Further, 

G^{m- fi-d,n-u)<{m- fi-dy. ^ Yl ^1a1' 

<5,A meet (3. 8), (3. 9) ieln-u] *' *' 

And, by the theorem 2.1, the total number of G[„_^] is on the order of 

m-fi-{n-u-d) \ ^ "^ n I a \ 

Ca-iTn — u — o,n — ly] <h — Ca-(m — u — o,n — v). 

n — V n 

Also, the total number of ways to choose in [z^] the partners of the vertices in [n — z^] 



IS 
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Therefore, for a given d, the total number of the subdigraphs that complement 
G[^j is on the order of 

5, A meet (3. 8), (3. 9) iG[n-u] ^ ^ ^ 

(\ n — iy / \ n — u 

n \ a ) 

<b - (m - ^ ! — n - ;/ n - / ^ ((n - ;/ w " ^ ^ -^^, 

for all u > , It; > . 

Setting u = w = X, and summing for 9 > , we obtain an overall bound for the 
count of complementary subdigraphs: 

(3.11) -(nA) (m-^)!n ^^ ^(^_^) 

\9>0 ■ / 

^-(,a)-(m-M)! ' ^,,'„_^, . 

(We have used n — z^ = 0(n) .) 

The total number of the partitions (A, B) is bounded above by the product of 
(3.7), (3.11) and (") . Dividing this product by the total number of the digraphs 
in question, we arrive at the bound (3.3). 

To get the most out of (3.3), we will use x = x{v, ^) , the minimum point of 

h{v, fi, x) = 2v ln(e^ — 1) — 2yu In x, 

i.e. the root of 

(3.12) /i,(i/, /i, x) = 2u-^ - ^ = 0. 

e^ — 1 X 

Considering v and fi as continuously varying, we compute the partial derivatives 

Xu = ^ 5(a;), Xf, = - g{x), 

(3.13) (e--l)2 

g{x) = — -, 

Since ^(0+) = 2 and g{+oo) = 1 , it follows that x{u,ii) = Q{{^ — u)/u) , uniformly 
for all < z^ < /U. In particular, in (3.3) the factor (x + 1)/^ is 0{l/v)] so we 
replace x + 1 with 1/v . (It is easy to show also that x(z/, /i) > 2(/x — z^)/yU-) 
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Consider ly < i^o a-nd ^^ < ^ < m/2. By (3.3) and (3.13), 

■ exp[h{i', fi, x{i', jj,)) — h^u, fi — l,x{i', fi — 1))] 
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(/iG [/i- 1,H, x = x(/i,i/)) 



:A' 



(3.14) 



2 ^ ^ 

m — fi fi — u 

= \^-^ ^exp(-lnx)<A2 

m — fi fi — V 

<bX^ x~'^{v,fi- 1); 

m — fi 



exp[/i^(z^,/i,x) + h^{u,fl,x)x^{u,fl)\ 



^ 



m — fi {fi — 1) — V 



exp(— Inx) 



{x{v,fi) increases with /i, ^^— ^ = (j){x{i',fi)) where 



(x):= 



xe 



e^ - 1 



l = e(x), 



uniformly for x > 0.) Let us show that the last expression decreases with fi. Using 
the formula for x^ in (3.13) for fi — 1 instead of /Li, we compute 



d f fi 



X "^{v^fi- 1) 



< 



dfi \m — fi 

m 

[m — fi)'^x'^ {m — fi) V x^e^{e^ — l — x) 
2 xe^ (e^ - 1)2 



2 fi (e^ - 1)^ 



(x := x{iy,fi- 1)) 



m 



(m — fi)'^x'^ {m — fi) e^ — 1 x'^e^{e^ — 1 — x) 



1 



x^{m — fi)'^ 



m — 2{m — fi) 



e^-1 
e^ — 1 — X 



<0, (!) 



as, fi < m/2. Thus indeed the RHS in (3.14) decreases with fi. At fi = u + 2, for 
jy < no = e(mv? jr^Yl'^ , this RHS is 



A^ - + 2 



x-2(^,^ + l)^ A^^f-) (1 + 0(1H) 
m — [ly + 2) m \u J 

= f-Hl + OilM) = e^ l!^!I^(i + 0(1/,)) < 2e\ 
im 4:171 r^ 

if ly exceeds a large enough v* . The same bound holds trivially for v < ly* , as 
x{iy, z^ + 1) > 2/{u + 1) . Hence 



E, 



E 
"'^ <b 2e^ < ^, {iy<na, i' + 2<fi< m/2), 



V^fl — l 



1 

2' 



provided that e > is chosen sufficiently small. Consequently, denoting x 
x{iy, z^ + 1) and using 



Ki^+iJ 



<h 



ly / n 



m \m 



n Ae e^ — 1 ^ ^ 

m c^ — 1' X i^ + l 



32/^^(1 + 0(l/z/)) 
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we bound 


'- E '^"'^-M<^E'^"'^-+i E^-(V3rM 


.,„;,r.™/. '^^"° V^-^1 / 




-(e^_l)2 ^2gA ■ 


V 


x2 (e^-l)2. 




rX^^.fXy r\^ ( A 


n m ^ \e^ — 


ly/ " n m ^" e^ 


-\ 



Thus 



(3.15) 



:0((r/n)(mA)-i) = O(n-i). 



Si:=- V ii?.,^ = 0(n-i) 



It remains to bound 



^2 :— — 7 ^ ^v,u- 
n ^-^ 

Since fc! = e(fci/2(^/g)fc) ^ ^^ ^^^^ ^^^ (33) ^^ j^g^^g 

T ^ 1 

(3.16) 
where 



re + 1 1 

-E^.A* <b exp{H{u,fi,x)) <b — exp(F(zy,^,x)), 



Hn,m{^-, /U, x) = H{v^ n,x) := Tilii Ti — v liii' — {u — v) ln(n — v) 

— mlnm + filn fi + [m — fi) ln{m — /u) 
(3.17) +z.(ln(e-_l)_21n(e^-l) + A) 



+ {fi — ly) In i^ — (fi — v) In 

e 

+ ^(21nA-lnx). 

We will use H^, , H^ , H^ to denote the partial derivatives of H^iy, fi, x) . Like 
before, given v < fi, we choose x = x{iy, n) , the root of (3.12), or equivalently the 
root of Hx{iy, n,x) = 0. Then 



dH{u,ii,x{u,^x)) 
dv 



(3.18) 



:H^{iy,fi,x{iy,fi)) 

■ — In ;/ + ln(n — z/) + In 



^x(u,^i) _ ^ 
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+ A 



fi — u 



Inz/ + ln(^ — z/). 
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By (3.13), x{i^, n) strictly decreases with v , and then so does dH{v,ii,x{v,^))/di' . 
That is, as a function of i^, H{v,^^x{u^ ^)) is convex. In fact, using x^(i^, /i) < 
and n — i^ < r , we have 

d'^H{v,ii,x{v,ix)) _ dHi,{u,ji,x{v,ix)) 
(3.19) 





du^ du 








1 


1 ^ e^('''^)x^(i/,/i) 
n - z/ e^('^'^) - 1 


' 1/2 " 


1 
u 


1 


V 


^ — u 


1 










r 











Since x(z^, ^) i as z^ t M ; a-^id x(z/, /i) f oo as z/ | , there exists a unique z/(/u) , 
the root of dH{i', fi,x{i', n))/di' = 0, at which H^u, fj,,x{iy, n)) attains its absolute 
maximum /(^) := H{i^{fi), iJ,,x{iy{n), fi)) . Observe immediately that the equation 
Hjy{i', fijxi^i', iJ,)) = 0, with H,y{u, ij,,x{iy, n)) given by (3.18), and (3.17) allow us to 
obtain a much simpler expression for /(/i) , namely 

f{fi) = — nln(l — i^/n) + ^In/i + (m — /u) ln(?Ti — /i) — mlnm 

(3.20) , X^jy 

+ yuln 



x{fi — u) 



Let us have a close look at z^(/i) and x(z/(/i),//) . 
Observe first that, by (3.13), 



— =X^ + X^Uf, = 1 



(3.21) ''^'^:::^" =x,+x^., = '-^{i-^u. 



Differentiating dH{u^ ^,x{u,^))/du = with respect to fi, using (3.13) and solving 
for u^ , we obtain 

(e" - l)-'g{x)/u 

'^^ 2z/-i + (n - z/)-i + (^ - z/)-i + ^z/-2 + (e^ - l)-^g{x)fii^-'^ ' 

So t'^(^) € (0, z/(//)//^) , i.e. z/(/^) and, by (30), x(z/(/^),//) both strictly increase 
with /i. 

Turning the tables around, introduce the functions i/(x) and /u(x) determined 
by two equations, (3.12) and Hjy{h', fj,,x) = 0, (see (3.18) for H^{u, ^,x)): 



(3.22) V = u{x) := n ^ ^ , /i = /^(x) := z/(x) 



where 0(x) = xe^/(e^ — 1) — 1 . Since z^(^) , x{iy{p), fi) are strictly increasing, then 
so are z/(x) and ^(x) . 

Using the parameterization i^{x), /i(x), let us evaluate sharply F{x) := f{p,{x)) 
= if (z^(x),//(x),x) for X < cA, with c > to be specified shortly. After some 
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algebra, (3.22) becomes 

_ 7x/6 - A fl _ /J , a;-A ^ 

^'- l + 0.5(x/A)2' ^2-^1 + ^-, 

(we have used m/n = 1 + A/2 + 0(A^)). In particular, /x(x) is of order m{x/X)'^ 
exactly. Plugging these expressions into (3.20) we obtain, after massive simplifica- 
tions, 

F{x) = -r G{x/\) + 0{mx^), 

(3.24) .3 ,2 

G(z):=ln(l + 0.5z2^ 



l + 0.5z2 1 + 0.5^2' 

(r = m -n). G{z) > for z G (0,zo], zq = 1.772, and G{z) ~ 1.5^^ as z i 0. 
For xq := Xzq , it follows from (3.23) that 

mi := ^(xo) = m — 2_(i + (9(A)) > 0.61m. 

1 + ^0 

Returning to the pameterization lyip), x{i^{fi), ii) , we see that x(z/(/i),^) < xq for 
/i < m-o = 0.61m, and 

/(^) =F(x(z.(m),^)) < -C2r (^^OMlI^y + 0{mx') 

(3.25) = -C3r-^ + 0(;uA2)<-C4r-^, 

m. m 

as A = 0{r/m) . 

With these preparations out of the way, turn directly to S2 . Given ^ G [uq + 
l,mi], (no = e(n/r2/3)), by (3.16), (3.19) and (3.25), 

\2- 



Ern -if ^\ V^ f (^ ~ '^(/")) \ 
E,,^, <b v^ exp (-C4r- j x 2^ exp ( -ci 1 

<bZ/o"^exp (-C4r— ) / 
V m/ J_, 



r-oo 



DO 

<6 n~^r^ '^ exp ( — C4r — 



Therefore 



jy» ,>-. J- O / O ^ , , 

(3.26) j.13/6 exp(— C4rz^o/'^) 

-^ rfl 1 - e-^4'-/m 

.7/6 



<b^ e-'^^'-'"' = 0(l/n). 
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Combining (3.26) and (3.15), we conclude that E[X] = 0{n~^). This completes 
the proof of Theorem 3.1 for the case r = o{n) . Since r/n — t- however slowly, 
we actually established our claim for r — > oo such that r < en, where e > is 
sufficiently small. 

Turn to r = G(n). Up to and including the equations (3.22) the proof is ba- 
sically the same. Furthermore, for x < xq, xq being small enough, Fn,m{x) = 
H{v[x),^{x),x) = — n©(x^). The challenge is to show that Fn^mi^) = — 0(n) for 
all X > Xq, such that ^{x) < m/2. This is definitely so for r < en. Let 



c 



* = supjc > e : Fn,n+cn{x) < if X > Xq and /i(x) < m/2}. 



If c* < oo, then for m = n + c*n there exists x* > xq such that /u(x*) < m/2 
and Fn^rn{x*) = 0, F^„(x*) = 0. Equivalently, we must have fn,m{P') = 0, 
fii,mil^) = for some < ;u < m/2, with fn,m{fJ') = /(/«) defined by (3.20). 
Since H^ifi, u,x{v, ^)) = 0, Hi,{i'{ji), ^,x{v, ^)) = 0, by (3.17) we have 

(3.27) f'^^^iix) = H^{^l,u{^J:),x{u{^J:),,x)) = \n(-^ ^-"j =o, 

\ni — n fi — jy X J 
so that, denoting x = x{u{ji), ^) , 



(3.28) 11 = ^^, </.(x) = -^-^ - 1. 

1 + 



m , , , xe 

1? 



x(j){x) 



Since ^ < m/2, it must be true that A^/x(/)(x) > 1, so that A > 4>{x) , as 4>{x) < x 
for X > 0. Furthermore, combining (3.27) and (3.20), we obtain 

(3.29) /„,^(^) = -nln(l-z/(^)/n) + mln(l-^/m) = 0. (!) 

Combining (3.28)-(3.29) with (3.22), and denoting ^(x) = xe^/(e^ ~ 1)) we see 
that there must exist a solution (A, x) of the following system of two equations: 



e 



</,(x)e'^^^^ = ( 1 



1 ^r^^M^) _ f, , a;0(^) 



A_ip^w V A2 



(3.30) — L(y,fa;)e<^(^) 

^ ' V'(A) 1 _ (6^-1)2^^^^"^ 



x0(x) (e-^ - ly 

Our task is to show that assuming existence of such a solution (A, x) we get a 
contradiction. 

Since ip{X) > 1, the first equation in (3.30) implies that 



Ji^,„e*) >! + £«-). 



(3-31) 1 + /> ,V; H^y"^ > 1 + ^^^'/■(A), 
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or, using the definition of ip{-) , 

X A 

As (/)(x) < A, and (e^ — l)/y is increasing, it follows that x > A. By (3.31), the 
second equation yields an inequality 

V^(A) 1 ^ -j^^W 

X(p[X) AT 

or 

A2 \ A2 



\ x(p[x) J X(p{X) 

or 

-|— (V'(x)-l) <V'(A)-V(x). 

This is impossible since ^p{x) > 1 and '0(A) < ip{x) . 

Thus, for r = 0(n) , Fn^m{x) = — n0(x^) for x < xq , and Fn^mix) = — 0(n), 
for all X > Xq such that /i(x) < ?ti/2. 

With this property established, the bound for S2 can be proved in essentially 
the same was as for r = o{n) . D 

Note 3.2. If a digraph with positive in/out degrees has a sink-set (a source- 
set) T with \T\ < n, then there is a source-set (a sink-set) T ^ T'^ . (If, say, 
T is a source-set, then, for a vertex v £ T'^ , the set consisting of v and all ver- 
tices reachable from w is a sink-set in T^.) Therefore if a digraph of this sort 
is not strongly-connected and has no simple sink/source-sets, it must have either 
a complex sink-set or a complex source-set with at most m/2 induced arcs. By 
Theorem 3.1, the probability of this event is 0{n~^). Thus, it remains to find a 
sharp estimate for the probability of no simple sink/source-sets and to check that 
this probability far exceeds n~^ . 

4. Counting the digraphs v^^ithout simple sink/source-sets. 

A subgraph induced by a simple sink-set (source-set resp.) is a disjoint union of 
cycles. So our task in this section is to find a sharp asymptotic formula for the 
probability that Gi^i{n,m) has no induced cycles C such that there are no arcs 
i ^ J (i ~^ ^ resp.) with i G C and j ^ C. 

Consider first the corresponding probability for the random multi-digraph 
MGi^i{n, m) induced by the ?7i-long insertion sequence chosen uniformly at random 
among all hi^i{n,m) such sequences. (See Note 1.2 and (1.17) for the definition 
of hi^i{n,m) .) Let X = X^ denote the total length of the cycles in all simple 
sink-sets and source-sets. 
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Lemma 4.1. 






(4.1) 



where 



hx,\{n,m) ^ {m)a 



i=l 



1 — yz T-r 1 — ye^^ 



y :^-^ i — yze-^ 



(4.2) if(x,y,z) = ^-^nT 
In particular, 

(4.3) P(X = 0) = i?[0^] = -A^^^V, , 



(m!)^ N-^ in] 



[(xiX2)'"-v] (1 - y)-' n(i - ye"')(e"' - 1)' 



i=l 



Proof of Lemma 4.1. Let us first derive a formula for E[(j,)] . (^) is the 
total number of unordered k tuples of vertices belonging to the cycles of simple 
sink/source-sets. By symmetry, E[(^)] equals (^) times the probability that the 
cycles contain vertices from [k] . To evaluate this probability we need to count the 
number of m-long insertion sequences that result in multi-digraphs in which the 
set [k] belongs to those cycles. We find it a bit easier to turn things around. First, 
we evaluate the number of the insertion sequences having several such (disjoint) 
cycles induced by a subset of vertices. Second, we multiply this number by the 
total number of ways to select a set of k points contained in these cycles, under 
a constraint that every cycle is represented in such a set. Third, we sum these 
products over all those collections of disjoint sets. Finally we divide the sum by 
/ii^i(n, m) , the total number of the insertion sequences. 

Let a denote a (generic) cardinality of a chosen set A of vertices; there are 
(") such sets. Let ai^i, . . . , ai^e^ {0.2,1, ■ ■ ■ , 0,2,12 ' resp.) denote the lengths of sink 
(source, resp.) cycles to be formed out of the a vertices. As loops allowed, Uij > 1 
and 

2 e^ 

(4.4) Yl Yl "^'i = "■ 

i=i j=i 

The total number of ways to form ii + £2 directed cycles is 

(4.5) _^TtK.-1)!^^L-|T^. 
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The total number of ways to select k points from these cycles subject to the rep- 
resentation constraint is 

E n(::;)=[-'in 

k^j>l: ||k||=fc i,j ^ '■'^ 

(4.6) =[w'']Yl[{l + wr^^^ -1]. 

The product of the counts in (4.5) and (4.6) is 

(4.7) j^^il + wT:^^ 

I, J '■> 

Given these cycles Ci^i, . . . , Ci^£^, C2,i, . . . ,C2,^2 > let 5^, 1 < i < £i , denote the 
number of arcs u ^ v with u ^ Ci^i and v G Ci^i , and let A^ , 1 < i < £2, denote 
the number of arcs u ^ v with u G 6*2,^ and v ^ C2,i. Since Ci^i is a sink cycle, 
there are no arcs from Ci^i to its complement; likewise, there are no arcs leading 
to C2,i from its complement. To avoid overcounting we consider an isolated cycle 
only as a source cycle. Thus we have only the constraint 5i > 1 , 1 < i < ^1 • For 
i ^ A,\ei 5i>l, A^ > 1 denote its in-degree and its out-degree. Clearly 

(4.8) ^6i + ^5i = m-a, J]] A^ + ^ A^ = m - a. 

By (1.5), the total number of the {m — a) -long insertion sequences that result 
in a multi-digraph on the vertex set {1, . . . ,^i,£i + 1, ... ,^1 -|- ^2, M \ ^} > with 
the in-degrees (5i, . . . , 5^^ , 8i, {i £ [n]\ A), and the out-degrees Ai, . . . , Ag^ , A^ , 
{ie [n]\A),is 

(("'-o^mifi^ n w 

«=1 * 1=1 * t€[n]\A 

We need to multiply this count by 

ii Hi 

since, say, for each of bi arcs ending at the "aggregated" vertex, that represents 
the cycle C\^i , there are a\^i = \Ci^i\ ways to specify the arc end. There is another 
factor still missing, namely {m)m-a , which counts the total number of ways to 
select those (m — a) positions in the whole ?Ti-long insertion sequence when a 
newly added arc belongs to that multi-digraph, rather than to one of the cycles. 
By (4.8), and the constraints on 5i, Si,Ai, the resulting product equals 



m\{m - a)\ [x^*"'^] (e^^ - l)""" JJ(e''i-'^i - 1) 
(4.9) 



i=l 
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Further, the product of the factors in (4.7) and (4.9) dependent on ai^i, a2,i , equals 

1 ii. (e«l..^l _l)[(l + u;)«l,. -1] 

(4.10) 



i = l 



ai,i 



1 i^ ea2,ia;i[(l + y;)a2,. _ ^j 



^2! 



i=l 



ai 



Summing the products (4.10) over Uij > 2 meeting the constraint (4.4), and then 
over ah ii , £2 we get 



iy^^Ei-AE 



1 /^-^ Je^^i - 1) [(1 + u;)'' - 1] 



(4.11) 



^i>0 "^' \fe>l 



£2>0 \6>1 



;''^2[(l + i(;)^-l] 



[y1 exp 



^^, (.>-.-i)[(i + .,)--i] ^^^. 



,b^2 rn , ^)6 _ 1] 



[(1 + 



6>1 

= [y'^]F(x,y,l+w;), 

see (4.2) for the definition of H{x, y, z) . 
Collecting all the pieces, we write 



b>l 



E 



""'^^ [^'] E 7^ [(xiX2)-1 i^(x, y, 1 + ^) n(e- - 1)"-. 



h{n,m 



[m] 



i=l 



Consequently 



E[{l + w)^] ="^w''E 



(ml) 



1^2 



h{n,m) ^-^ {m)a 
which is equivalent to (4.1). D 



X1X2) 



\ i=l J 



Armed with this lemma, we obtain an asymptotic formula for V{X = 0) . 

Theorem 4.2. For the random multi- digraph MGi^i{n, m) , with r = m — n ^ 00 , 
m = 0{n) , 



(4.12) 



P{X = 0) 



A 
" e^-1 

A 
e^(e^ - 1) 



[l + 0(r-Mn r)) 
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with A determined by Xe'^/{e^ — 1) = m/n. 

Proof of Theorem 4.2. In the formula (4.3) for P(X = 0) , 

2 2 2 

(1 - 2/)-i n(i - 2/e^o = 1 + Y[{^^' - 1) E 2^" - [1 + E(«"' - 1)] y 

i=l i=l a/0,1 i = l 



/ii,i(n,m) = (m!)2([a;'"](e^ - 1)")^ 



and 

So (4.3) becomes 

(4.13) P(X = 0) = 1+ 5] M!Lq2(^^^_i)_ n(Q(l^l)^Q(1^0))2^ 

l<a<n 

where 

_ [x— "] (e- - 1)-'' 

^^'^•" [x-](e--l)- ■ 
As in the proof of the theorem 2.1, for a,b < n, 

n( h\^ A° / n {n)a (ny ( e^ - I 

Q(a,b) <b -7-T TTT-J- T, 7—r<{— - 



{e^ — ly \j n — b^ {rn)a ~ ^mJ \ Ae 
and consequently, setting 

21nr 



after simple estimates we have 



X 



Q(r n In r) 



(4.14) y^ ^2^ Q^U a-l)<h e"^"^ min{l. A} < r'^ min{l. A}. 



(m 



{We compare the series tail with min{l. A} because the latter is the order of the 
RHS in (4.12).) 

Turn to a < An . Let us have a close look at [x"*"**] (e^ — i)"+i-'» ^ the numerator 
in the fraction Q{a,a — 1) . Introduce A, satisfying 



xe m 



e^ — 1 n + 1 ' 
and the independent positive Poissons y* = y*(A). Using (2.7) we have 

/ A_lNn+l-a /"+!-'' \ 

[a;"^-°] (e^ - 1)"+!-" = ^^ ,J_^ p Y^ Y^ = m-a] 

/n+l \ 



^ ^ P( J]y^=m) (l + 0(an-i+aVn-2)) 
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A bit of calculus, based on (3.13), shows that A = A + 0{n~^) . Consequently 



(l + 0(r-^+n-^a)), 



\m — a \m-a 

and 

1 + (J[r 
^^^^ / v-"v"'+l)Var[; j; . ^.^^ 

Therefore 

A' 



P yy^=m = ^^^^"^ I = (l + 0(r-i))P yy^ = m 

^^ I ^2^(n+l)Var[y])i/2 ^ ^ ^' \ A^ 



Q{a, a - 1) = (l + Oir-^ + an'^ + aVn-^)) ■ 



(gA_l)a-l 

(4.15) =(l + 0(r-Mn2r))- 



(gA_ l)a-l 



uniformly for a < An ■ 

Also, uniformly for a < A^ , 



Using (4.15)-(4.16), m/n = Xe^/{e^ — 1), and denoting a = X/[e^{e^ — 1)], we 
compute: 



(4-17) E }^Q\a,a-l) = {e'-lf[ Y. cx'^ | (l + O(r-MnV)) 

(jfe^-l)2 



l<a<A„ ^ ' \l<a<A„ 



1-0- 



■[l + 0(fJ^") + 0(r-MnV)] 



1-0- 
as 



''^'' ^^ (l + 0(r-iln2r)). 



O" 



<g-(l-a)A„ <g-AA„ <^-l^ 



For the last, negative, term in (4.13) we need to be more precise. Picking a simple 
contour L enclosing the origin, by Cauchy integral formula we have 

1 /■ Ce^ - 11" 

L 

1 n / (e^ - l)'^-ie^ , 



27ri m J z"^' 

L 

— {[x""-^] (e^ - 1)" + [x^-i] (e^ - 1)"-^|. 
m 



So, recalling the definition of Q{-, •) , 

777 

(4.18) Q(i,i) + Q(i,o) = -. (!) 

n 
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Combining (4.14), (4.16) and (4.17), we easily transform (4.13) into (4.12). D 

Turn now to the random digraph Gi^i{n,m) . Let Pn,m denote the probabihty 
that Gi^i{n,m) has no simple sink-sets and source sets. The theorem 4.2 provides 
the asymptotic estimate for Vn,m = PiX = 0) , the corresponding probability for 
the random multigraph MGi^i{n,m) . We can write 

\S*\ 
■p 

' rt,.m 



hi^i(n,m) 



here S* is the set of all m-long sequences of arc-insertions that result in a multi- 
digraph without the simple sink/source sets. Denoting by S{s) and A(s) the 
in/out degrees of a multi-digraph corresponding to a generic sequence s, we see 
that the total number of the graphic sequences s leading to a digraph without 
simple sink/source sets is 



hf^,{n,m):= J] F(d(s), A(s)). 



By (1.22),andP„,^ = G(r/n), 



ce„2Ts 



= e"'' Vn,mhi^i{n, m) (l + ©(r^n'^/^+T) + ©(ne"''""'^ )) 
= e-''Pn,^/ii,i(n,m)(l + 0(r^n-i/2+7)). 



Therefore 

if I 77 777 1 

(4.19) Pn,m = ''') ' = e-^Vn,m{l + 0{r'n-'/^+^)) , 

ni,i(n, m) ^ ' 

for e > and 7 G (0, 1/2) . 

Combining (4.19), (4.12) and recalling the formula (1.12) for rj, we have proved 
the following. 

Theorem 4.3. For r — t- 00 and m = 0{n) , 

A ^' 
1 



\ e^ — I J / m X^ 

-^71,771 — \ exp I — — 

^ A \ n 2 

~ e^(e^ - 1) 
(4.20) X (1 + 0{r-^ In^ r + r^n-^/^+T)) ^ 

for e>0 and 7 € (0, 1/2) . 

By (4.20), Pn,m = 6(A) = e{r/n) » n"! . In light of Note 3.2, (4.20) combined 
with Theorem 1.2 complete the proof of our claim (1) in the introduction. 

Acknowledgement. Back in 2001 , during my memorable visit to Nick Wormald 
at University of Melbourne, we spent most of our time together working on the 
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alternative derivation of the Bender-Canfield-McKay formula for the count of con- 
nected sparse graphs, the remarkable formula that struck us by how sharp it was. 
It occurred to us then that the embedding idea might also work for counting the 
strongly-connected directed graphs. It was immediately clear though that an in- 
trinsically harder notion of strong connectivity presented a difficult new challenge. 
In Summer of 2009 Nick informed me that together with Xavier Perez they had 
made a further progress toward obtaining a fully-proved asymptotic formula for the 
counts of strongly-connected digraphs . In particular, this formula would yield a 
directed counterpart of the BCM formula, but without an explicit estimate of an 
error term. Reenergized by the news, I set up to see if I could obtain a sharp as- 
ymptotic formula, with a remainder term qualitatively matching that of the BCM 
formula. The approach in this paper is naturally very different from the one used 
by Perez and Wormald. However the paper would not be possible without the joint 
work with Nick on the BCM formula, and our initial attacks on the directed case. 
I owe to Nick my debt of gratitude. 
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